Search CORE

48 research outputs found

Dinucleotide controlled null models for comparative RNA gene prediction

Author: A Coventry
A Rambaut
A Siepel
AM Pedersen
AV Uzilov
C del Val
C Lanave
C Weile
C Workman
D Karolchik
D Metzler
D Rose
DM Robinson
DR Forsdyke
E Rivas
E Torarinsson
G Lunter
I Miklós
IL Hofacker
J Felsenstein
J Jensen
J Thorne
J Thorne
K Missal
K Missal
L Duret
M Blanchette
M Hasegawa
M Schöniger
M Schöniger
O Gascuel
OF Christensen
P Clote
PF Arndt
R Backofen
R Fleißner
S Griffiths-Jones
S Griffiths-Jones
S Guindon
S Tavaré
S Washietl
S Washietl
S Washietl
S Washietl
S Washietl
SF Altschul
Stefan Washietl
T Babak
T Gesell
T Mourier
T Sandmann
Tanja Gesell
YVan de Peer
Z Yao
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Comparative prediction of RNA structures can be used to identify functional noncoding RNAs in genomic screens. It was shown recently by Babak <it>et al</it>. [BMC Bioinformatics. 8:33] that RNA gene prediction programs can be biased by the genomic dinucleotide content, in particular those programs using a thermodynamic folding model including stacking energies. As a consequence, there is need for dinucleotide-preserving control strategies to assess the significance of such predictions. While there have been randomization algorithms for single sequences for many years, the problem has remained challenging for multiple alignments and there is currently no algorithm available. Results We present a program called SISSIz that simulates multiple alignments of a given average dinucleotide content. Meeting additional requirements of an accurate null model, the randomized alignments are on average of the same sequence diversity and preserve local conservation and gap patterns. We make use of a phylogenetic substitution model that includes overlapping dependencies and site-specific rates. Using fast heuristics and a distance based approach, a tree is estimated under this model which is used to guide the simulations. The new algorithm is tested on vertebrate genomic alignments and the effect on RNA structure predictions is studied. In addition, we directly combined the new null model with the RNAalifold consensus folding algorithm giving a new variant of a thermodynamic structure based RNA gene finding program that is not biased by the dinucleotide content. Conclusion SISSIz implements an efficient algorithm to randomize multiple alignments preserving dinucleotide content. It can be used to get more accurate estimates of false positive rates of existing programs, to produce negative controls for the training of machine learning based programs, or as standalone RNA gene finding program. Other applications in comparative genomics that require randomization of multiple alignments can be considered. Availability SISSIz is available as open source C code that can be compiled for every major platform and downloaded here: <url>http://sourceforge.net/projects/sissiz</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

DNA word analysis based on the distribution of the distances between symmetric words

Author: A Dayn
A Teixeira-Silva
B Powdel
DB Haniford
DR Forsdyke
DW Huang
DW Huang
ES Lander
G Benson
G Crooks
H Inagaki
H Zhang
J Kolb
J Qi
JC Fu
JC Venter
M Hackenberg
MS O’Bleness
RZ Cer
S Ding
S Levy
V Afreixo
V Afreixo
V Brázda
VN Potaman
Y Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

We address the problem of discovering pairs of symmetric genomic words (i.e., words and the corresponding reversed complements) occurring at distances that are overrepresented. For this purpose, we developed new procedures to identify symmetric word pairs with uncommon empirical distance distribution and with clusters of overrepresented short distances. We speculate that patterns of overrepresentation of short distances between symmetric word pairs may allow the occurrence of non-standard DNA conformations, such as hairpin/cruciform structures. We focused on the human genome, and analysed both the complete genome as well as a version with known repetitive sequences masked out. We reported several well-defined features in the distributions of distances, which can be classified into three different profiles, showing enrichment in distinct distance ranges. We analysed in greater detail certain pairs of symmetric words of length seven, found by our procedure, characterised by the surprising fact that they occur at single distances more frequently than expecte

Crossref

Repositório Institucional da Universidade de Aveiro

Modeling rejection immunity

Author: A Humar
A Matzavinos
A Ruggeri
AJL Torres
AK Abbas
AL Taylor
Alice Matone
AndreaDe Gaetano
Annamaria Agnes
DJ Post
DR Flower
DR Forsdyke
DS Gould
E Morelon
Francesco Ria
G Benichou
G Benichou
GF Franklin
I Cote
JW Yewdell
MA Nowak
N Safinia
P Marchetti
Pasquale Palumbo
PS Heeger
R Otton
S Faderl
S Marino
Sabina Magalini
SC Gonçalves
SH Bajaria
SM Ciupe
W Ke-Hua
WO Kermack
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Epigenetic Regulation of HIV-1 Latency by Cytosine Methylation

Human immunodeficiency virus type 1 (HIV-1) persists in a latent state within resting CD4+ T cells of infected persons treated with highly active antiretroviral therapy (HAART). This reservoir must be eliminated for the clearance of infection. Using a cDNA library screen, we have identified methyl-CpG binding domain protein 2 (MBD2) as a regulator of HIV-1 latency. Two CpG islands flank the HIV-1 transcription start site and are methylated in latently infected Jurkat cells and primary CD4+ T cells. MBD2 and histone deacetylase 2 (HDAC2) are found at one of these CpG islands during latency. Inhibition of cytosine methylation with 5-aza-2′deoxycytidine (aza-CdR) abrogates recruitment of MBD2 and HDAC2. Furthermore, aza-CdR potently synergizes with the NF-κB activators prostratin or TNF-α to reactivate latent HIV-1. These observations confirm that cytosine methylation and MBD2 are epigenetic regulators of HIV-1 latency. Clearance of HIV-1 from infected persons may be enhanced by inclusion of DNA methylation inhibitors, such as aza-CdR, and NF-κB activators into current antiviral therapies

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

The Proteomic Code: a molecular recognition code for proteins

Author: A Bhakoo
AA Komar
B Benyo
C Levinthal
CB Anfinsen
CR Woese
CR Woese
D Naor
DA Weigent
DR Forsdyke
E Azarya-Sprinzak
E Neher
F Glaser
FHC Crick
G D'Onofrio
G Gamow
G Gamow
G Gamow
G Gamow
G Gamow
G Gamow
H Fan
H Okada
HM Berman
HM Berman
IA Adzhubei
IZ Siemion
IZ Siemion
IZ Siemion
J Biro
J Biro
J Biro
J Biro
Jan C Biro
JC Biro
JC Biro
JC Biro
JC Biro
JC Biro
JC Biro
JC Biro
JC Biro
JC Biro
JC Biro
JC Biro
JD Watson
JE Blalock
JE Blalock
JE Blalock
JE Blalock
JE Blalock
JE McGuigan
JE Zull
JE Zull
JG Omichinski
JR Heal
JR Heal
JR Heal
JR Heal
JT Wong
K Ikehara
K Ikehara
K Nord
KC Gokhale
KI Rother
KL Bost
KL Bost
L Baranyi
L Baranyi
L Baranyi
L Katz
L Pauling
L Pauling
L Pauling
LB Mekler
LB Mekler
M Eilers
M Oresic
M Zuker
ML Chiusano
MO Dayhoff
MS Singer
O Ermolaeva
RS Root-Bernstein
RS Root-Bernstein
S Brunak
S Walter
SD Seiwert
SK Gupta
T Junier
T Pawson
T Xie
TA Thanaraj
TS Kumarevel
U Segerstéen
W Gu
W Gu
W Seffens
WL Duax
Y Isogai
Y Shao
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background The Proteomic Code is a set of rules by which information in genetic material is transferred into the physico-chemical properties of amino acids. It determines how individual amino acids interact with each other during folding and in specific protein-protein interactions. The Proteomic Code is part of the redundant Genetic Code. Review The 25-year-old history of this concept is reviewed from the first independent suggestions by Biro and Mekler, through the works of Blalock, Root-Bernstein, Siemion, Miller and others, followed by the discovery of a Common Periodic Table of Codons and Nucleic Acids in 2003 and culminating in the recent conceptualization of partial complementary coding of interacting amino acids as well as the theory of the nucleic acid-assisted protein folding. Methods and conclusions A novel cloning method for the design and production of specific, high-affinity-reacting proteins (SHARP) is presented. This method is based on the concept of proteomic codes and is suitable for large-scale, industrial production of specifically interacting peptides.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central